A FAST GENERAL METHODOLOGY FOR 
INFORMATION-THEORETIC ALLY OPTIMAL ENCODINGS OF 

GRAPHS* 

XIN HEt, MING- YANG KAO* AND HSUEH-I LU § 

Abstract. We propose a fast methodology for encoding graphs with information-theoretically 
minimum numbers of bits. Specifically, a graph with property ir is called a ir-graph. If tt satisfies 
certain properties, then an n-node m-edge 7r-graph G can be encoded by a binary string X such that 
(1) G and X can be obtained from each other in 0(n log n) time, and (2) X has at most f3(n) + o(/3(n)) 
bits for any continuous super-additive function f}(n) so that there are at most 2^( n ) +o( ^( n )) distinct n- 
node 7r-graphs. The methodology is applicable to general classes of graphs; this paper focuses on planar 
graphs. Examples of such 7r include all conjunctions over the following groups of properties: (1) G is 
a planar graph or a plane graph; (2) G is directed or undirected; (3) G is triangulated, triconnected, 
biconnected, merely connected, or not required to be connected; (4) the nodes of G are labeled with 
labels from {l,...,^} for l\ < n; (5) the edges of G are labeled with labels from {1,...,£ 2 } for 
\ £ 2 < m; and (6) each node (respectively, edge) of G has at most £ 3 — 0(1) self-loops (respectively, 

£4 = 0(1) multiple edges). Moreover, £ 3 and £4 are not required to be O(l) for the cases of tt being a 
plane triangulation. These examples are novel applications of small cycle separators of planar graphs 
and are the only nontrivial classes of graphs, other than rooted trees, with known polynomial-time 
information-theoretically optimal coding schemes. 
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1. Introduction. Let G be a graph with n nodes and m edges. This paper studies 
the problem of encoding G into a binary string X with the requirement that X can be 
decoded to reconstruct G. We propose a fast methodology for designing a coding scheme 
CNj ! such that the bit count of X is information-theoretically optimal. Specifically, a function 

(3{n) is super- additive if /3(ni) + ^(n^) < (3{ni + n 2 ). A function j3{n) is continuous if 
f3{n + o(n)) = /3(n) + o(/3(n)). For example, /3(n) = n c log d n is continuous and super- 
additive, for any constants c > 1 and d > 0. The continuity and super-additivity are 
closed under additions. A graph with property n is called a it -graph. If 7r satisfies 
certain properties, then we can obtain an X such that (1) G and X can be computed 
from each other in 0(n log n) time and (2) X has at most /3(n) + o((3(n)) bits for any 
continuous super-additive function (3{ri) so that there are at most 

2 /3(n)+o(/3(n)) distinct 

n-node m-edge 7r-graphs. The methodology is applicable to general classes of graphs; 
this paper focuses on planar graphs. 

A conjunction over k groups of properties is a boolean property 7Ti A • • • A 7Tfc, where 
7Tj is a property in the z-th group for each i — 1, . . . , k. Examples of suitable 7r for our 
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methodology include every conjunction over the following groups: 
Fl. G is a planar graph or a plane graph. 
F2. G is directed or undirected. 

F3. G is triangulated, triconnected, biconnected, merely connected, or not required 
to be connected. 

F4. The nodes of G are labeled with labels from {1, . . . , £{\ for £\ < n. 

F5. The edges of G are labeled with labels from {1, . . . , £2} for £2 < m. 

F6. Each node of G has at most £3 = 0(1) self-loops. 

F7. Each edge of G has at most £4 = 0(1) multiple edges. 
Moreover, £ 3 and £ 4 are not required to be 0(1) for the cases of 7r being a plane trian- 
gulation. For instance, n can be the property of being a directed unlabeled biconnected 
simple plane graph. These examples are novel applications of small cycle separators of 
planar graphs [PT],[I^|. Note that the rooted trees are the only other nontrivial class of 
graphs with a known polynomial-time information-theoretically optimal coding scheme, 
which encodes a tree as nested parentheses using 2(n — 1) bits in 0(n) time. 

Previously, Tutte proved that there are 2 /3 ^ + °^ l3 ^ m ^ distinct m-edge plane trian- 
gulations where (3{m) — (| — log 2 3)m + o(m) ~ 1.08m + o(m) |HJ an d that there are 
2 2m +°( n ) distinct m-edge n-node triconnected plane graphs that may be non-simple . 
Turan |l6j used 4m bits to encode a plane graph G that may have self-loops. Keeler and 
Westbrook [l(J improved this bit count to 3.58m. They also gave coding schemes for 
several families of plane graphs. In particular, they used 1.53m bits for a triangulated 
simple G, and 3m bits for a connected G free of self-loops and degree-one nodes. For a 
simple triangulated G, He, Kao, and Lu improved the bit count to |m + 0(1). For 
a simple G that is triconnected and thus free of degree-one nodes, they improved 
the bit count to at most 2.835m bits. This bit count was later reduced to at most 
^f^m + 0(l) « 2.378m + 0(1) by Chuang, Garg, He, Kao, and Lu @. These coding 
schemes all take linear time for encoding and decoding, but their bit counts are not 
information-theoretically optimal. For labeled planar graphs, Itai and Rodeh || gave 
an encoding of |nlogn + O(n) bits. For unlabeled general graphs, Naor [14| gave an 
encoding of \n 2 — n \ogn + 0(n) bits. 

For applications that require query support, Jacobson gave a 0(n)-bit encoding 
for a connected and simple planar graph G that supports traversal in O(logra) time per 
node visited. Munro and Raman improved this result and gave schemes to encode 
binary trees, rooted ordered trees, and planar graphs. For a general planar G, they used 
2m + 8n + o(m + n) bits while supporting adjacency and degree queries in 0(1) time. 
Chuang et al. || reduced this bit count to 2m + (5 + |)n + o(m + n) for any constant 
k > with the same query support. The bit count can be further reduced if only 0(1)- 
time adjacency queries are supported, or if G is simple, triconnected or triangulated 0. 
For certain graph families, Kannan, Naor and Rudich || gave schemes that encode 
each node with O(logra) bits and support 0(logn)-time testing of adjacency between 
two nodes. For dense graphs and complement graphs, Kao, Occhiogrosso, and Teng 
devised two compressed representations from adjacency lists to speed up basic graph 
search techniques. Galperin and Wigderson J| and Papadimitriou and Yannakakis |L5] 
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investigated complexity issues arising from encoding a graph by a small circuit that 
computes its adjacency matrix. 

Section § discusses the general encoding methodology. Sections |] and | use the 
methodology to obtain information-theoretically optimal encodings for various classes 
of planar graphs. Section |5] concludes the paper with some future research directions. 

2. The encoding methodology. Let \X\ be the number of bits in a binary string 
X. Let |G| be the number of nodes in a graph G. Let \S\ be the number of elements, 
counting multiplicity, in a multiset S. 

Fact 1 (see Let X t , X 2 , . . . , X k be 0(1) binary strings. Let n = \Xi\ + 

| J^2 1 + ■ ■ • + \Xk\. Then there exists an 0(logn)-bit string \, obtainable in 0(n) time, 
such that given the concatenation of x, Xi, X 2 , . . . , Xj., the index of the first symbol of 
each Xi in the concatenation can be computed in 0(1) time. 

Let X\ + X 2 + • — h Xk denote the concatenation of x, Xi, X 2 , . . . , Xk as in Fact |l[ 
We call x t ne auxiliary binary string for X\ + X 2 + • • ■ + X k . 

A graph with property tt is called a ir-graph. Whether two 7r-graphs are distinct 
or indistinct depends on tt. For example, let Gi and G 2 be two topologically non- 
isomorphic plane embeddings of the same planar graph. If tt is the property of being a 
planar graph, then G\ and G 2 are two indistinct 7r-graphs. If tt is the property of being 
a planar embedding, then G\ and G 2 are two distinct 7r-graphs. Let a be the number 
of distinct n-node 7r-graphs. Clearly it takes |~log 2 a] bits to differentiate all n-node 
7r-graphs. Let index7,-(0) be an |~log 2 a] -bit indexing scheme of the a distinct 7r-graphs. 

Let Go be an input n -node 7r-graph. Let A = log log log (no). The encoding algo- 
rithm encode 7r (G ) is merely a function call code 7r (G , A), where the recursive function 
code„-(0, A) is defined as follows: 

function code 7r (G, A) 

{ 

if |G| = 0(1) or |G| < A then 

return index w (G) 
else 

{ 

compute 7r-graphs G±, G 2 , and a string X, from which G can be recovered; 
return code 7r (Gi, A) + code n (G 2 , A) + X; 

} 

} 

Clearly, the code returned by algorithm encode„-(Oo) can be decoded to recover G . 
For notational brevity, if it is clear from the context, the code returned by algorithm 
encode„-(Go) (respectively, function code ff (G, A)) is also denoted encode,r(Go) (respec- 
tively, code 7r (G, A)). 

Function code„-(G, A) satisfies the separation property if there exist two constants c 
and r, where < c < 1 and r > 1, such that the following conditions hold: 
PI. max(|Gi|,|G 2 |) < \G\/r. 
P2. (Gil + |G 2 | = \G\ + 0(\G\ c ). 
P3. \X\ = 0(\G\ C ). 
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Let /(|G|) be the time required to obtain index 7r (G) and G from each other. Let 
g(|G|) be the time required to obtain Gi,G 2 ,X from G, and vice versa. 

Theorem 2.1. Assume that function code 7r (G, A) satisfies the separation property; 
and that there are at most 2^ n ) + ° < ^ (n ^ distinct n-node n-graphs for some continuous 
super- additive function /3(n). 

1. |encode 7r (G )| < /3(n ) + o(/3(n )) for any n -node n-graph G . 

2. If f(n) = 2™° (1) and g{n) = 0{n), then Go and encode^ Go) can be obtained 
from each other in O(nologno) time. 

Proof. The theorem holds trivially if no = 0(1). For the rest of the proof we 
assume n = w(l), and thus A = uj(1). Many graphs may appear during the execution 
of encode 7r (Go)- These graphs can be organized as nodes of a binary tree T rooted at 
Go, where (i) if G\ and G 2 are obtained from G by calling code 7r (G, A), then G\ and G 2 
are the children of G in T, and (ii) if \G\ < A, then G has no children in T. Further 
consider the multiset 5 consisting of all graphs G that are nodes of T. We partition 5 
into I + 1 multisets 5(0), 5(1), S(2), . . . , S(£) as follows. 5(0) consists of the graphs G 
with |G| < A. For % > 1, S(i) consists of the graphs G with r l-1 A < \G\ < r % \. Let 
G e 5(£), and thus set £ = 0(log f ). 

Define p = Eh€S(o) 1-^1- We first show 

(!) I^OK^T, 

for every i = 1, ...,£. Let G be a graph in S(i). Let 5(0,0) be the set consisting 
of the leaf descendants of G in T; for example, 5(0, Go) = 5(0). By Condition P|], 
|G| < Z)_ffes , (o,G) 1-^1- By Condition P(l|, no two graphs in 5(z) are related in T. Therefore 
5(z) contains at most one ancestor of H in T for every graph in 5(0). It follows 
that Ecesw \G\ < Eces(i) Eh G 5(o,g) l#l < P- Since |G| > r <_1 A for every G in 5(i), 
Inequality (P holds. 

Statement 1. Suppose that the children of G in T are G\ and G 2 . Let 6(G) = 
|A^| + Ixl) where x is the auxiliary binary string for code 7r (Gi,A) + code„-(G2,A) + 
X. Let q = Ei>iEGe5(i) & ( G )- Then ' |encode 7r (G )| = q + Ene5(o) |code 7r (i?, A)| < 
g + E/fes(o)(/5(l-^1) + °(/5(|-^|)))- By the super-additivity of /3(n), |encode„-(Go)| < 
9 + /3(p) + °(/3(p))- Since /3(n) is continuous, Statement 1 can be proved by showing 
P = n + o(n ) and q = o(n ) below. 

By Condition 1%, \X\ = 0(\G\ C ). By Fact |, |%| = 0(log|G|). Thus, b(G) = 
0(|G| C ), and 

(2) 9 = E E 0(|G| C ). 

Now we regard the execution of encode 7r (Go) as a process of growing T. Let a(T) = 
J2h is a loaf of t \H\. At the beginning of the function call encode 7r (Go), T has exactly one 
node G , and thus a(T) = n . At the end of the function call, T is fully expanded, and 
thus a(T) = p. By Condition P|2], during the execution of encode„-(Go), every function 
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call code 7r (G, A) with |G| > A increases a(T) by 0(|G| C ). Hence 

(3) P = n + J2 E 0{\G\% 

i>l GeS(i) 

Note that 

( 4 ) E E l G l C ^ XX^V^A) =pA c ~^E r(C_1)i =M C " 1 C(1) = o(p). 

i>l GeS(i) »>1 «>1 

By Equations @ and (f|), p = n + o(p), and thusp = O(n ). Hence J2i>i J2ces(i) \G\ C = 
o(no). By Equations @ and @), p = n + o(n ) and g = o(no), finishing the proof of 
Statement 1. 

Statement 2. By Conditions P0 and P|, |#| = Q(X) for every F £ 5(0). Since 
HffeSfo) 1-^1 — P — n o + o(n ), \S(0)\ = O(n /X). Together with Equation ([!]), we know 
\S(i)\ = for every i — 0, . . . ,£. By the definition of S(i), \G\ < r*A for every 

i — 0, . . . , £. Therefore Go and encode^ (Go) can be obtained from each other in time 

yO(/(A) + £ r-^A)). 

A i<«<^ 

Clearly /(A) = 2 A ° (1) = 2°( lo s lo s n °) = o(logn ). Since I = O(logn ) and g(n) = 0(n), 
J2i<i<e r ~ l g( rl ^) — J2i<i<e A = O(Alogn ), and Statement 2 follows. □ 

Sections |3] and [| use Theorem ^TT] to encode various classes of graphs G. Section ^| 
considers plane triangulations. Section [| considers planar graphs and plane graphs. 

3. Plane triangulations. A plane triangulation is a plane graph, each of whose 
faces has size exactly three. Note that a plane triangulation may contain self-loops and 
multiple edges. Every n-node plane triangulation, simple or not, has exactly 3n — 6 
edges. In this section, let ir be an arbitrary conjunction over the following groups of 
properties of a plane triangulation G: F|j, F|| and F^, where £3 and £4 are not required 
to be 0(1). Our encoding scheme is based on the next fact. 

Fact 2 (See ||12|| ). Let H be an n-node m-edge undirected plane graph, each of 
whose faces has size at most d. We can compute a node-simple cycle C of H in 0{n-\-m) 
time such that 

• C has at most 2y/dn nodes; and 

• the numbers of H's nodes inside and outside C are at most 2n/3, respectively. 
Let G be a given n-node 7r-graph. Let G' be obtained from the undirected version 

of G by deleting the self-loops. Clearly each face of G' has size at most four. Let C' be 
a cycle of G' having size at most 4-^/n guaranteed by Fact 0. Let G consist of the edges 
of G corresponding to the edges of C' in G' . Note that G is not necessarily a directed 
cycle if G is directed. Since G' does not have self-loops, 2 < |G| < Ay/n. If £4 > 2, then 
|G| can be two. Let Gj n (respectively, G out ) be the subgraph of G formed by G and the 
part of G inside (respectively, outside) G. Let x be an arbitrary node on G. 

Gi is obtained by placing a cycle C\ of three nodes outside G in and then triangulat- 
ing the face between C\ and G; n such that a particular node y\ of C\ has degree strictly 
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lower than the other two. Clearly this is doable even if |G| = 2. The edge directions of 
G\ — G m can be arbitrarily assigned according to n. 

G2 is obtained from G out by (1) placing a cycle C2 of three nodes outside G ou t and 
then triangulating the face between C2 and G out such that a particular node 2/2 of G2 
has degree strictly lower than the other two; and (2) triangulating the face inside C by 
placing a new node z inside of C and then connecting it to each node of C by an edge. 
Note that (2) is doable even if \C\ = 2. Similarly, the edge directions of G2 — G out can 
be arbitrarily assigned according to 71. 

Let u be a node of G. Let v be a node on the boundary B(G) of the exterior face 
of G. Define dfs(w, G, v) as follows. Let w be the counterclockwise neighbor of v on 
B(G). We perform a depth-first search of G starting from v such that (1) the neighbors 
of each node are visited in the counterclockwise order around that node; and (2) w is 
the second visited node. A numbering is assigned the first time a node is visited. Let 
dfs(w, G, v) be the binary number assigned to u in the above depth-first search. Let 
X = 6Ss(x,Gi,yi) + 6Ss(x, G 2 ,y 2 ) + dis(z,G 2 ,y 2 ). 

Lemma 3.1. 

1. G\ and G2 are it-graphs. 

2. There exists a constant r > 1 with max(|Gi|, IG2I) < n/r. 

3. |Gi| + |G 2 | = n + 0{^/n~). 

4. \X\ = O(logn). 

5. Gi,G 2 ,X can be obtained from G in 0(n) time. 

6. G can be obtained from G 1 ,G 2 ,X in 0{n) time. 

Proof. Statements [l|-|5] are straightforward by Fact and the definitions of Gi, G2 
and X. Statement |6] is proved as follows. It takes 0(n) time to locate y± (respectively, 
I/2) in G\ (respectively, G2) by looking for the node with the lowest degree on B(G\) (re- 
spectively, S(G 2 )). By Fact [1], it takes 0(1) time to obtain dfs(?/i, Gi, x), dfs(y 2) G 2 , x), 
and dfs(?/2, G2, z) from X. Therefore x and z can be located in G\ and G2 in 0{n) time 
by depth- first traversal. Now G m can be obtained from G\ by removing B(G±) and 
its incident edges. The cycle C in Gi n is simply B{G- m ). Also, G out can be obtained 
from G2 by removing S(G 2 ), 2, and their incident edges. The C in G out is simply the 
boundary of the face that encloses z and its incident edges in G2. Since we know the 
positions of x in G in and Gout, G can be obtained from G m and G out by fitting them 
together along C by aligning x. The overall time complexity is 0(n). □ 

Theorem 3.2. Let Go be an n^-node ix-graph. Then Go and encode 7r (Go) can 
be obtained from each other in O(nologn ) time. Moreover, |encode„-(Go)| < /3(n ) + 
o(/3(no)) for any continuous super- additive function /3(n) such that there are at most 
2/3(n)+o(/3(n)) di s fo nc t n-node n- graphs. 

Proof. Since an n-node 7r-graph has 0(n) edges, there are at most 

2 0(n log n) distinct 

n-node 7r-graphs. Thus, there exists an indexing scheme index^G) such that index 7r (G) 
and G can be obtained from each other in 2' G ' otl) time. The theorem follows from 
Theorem |2.1| and Lemma |3.1| . □ 



4. Planar graphs and plane graphs. In this section, let 7r be an arbitrary 
conjunction over the following groups of properties of G: Fjl|, F§, F^|, F|^, and F[7[ 
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Clearly an n-node 7r-graph has 0(n) edges. 

Let G be an input n-node 7r-graph. For the cases of ir being a planar graph rather 
than a plane graph, let G be embedded first. Note that this is only for the encoding 
process to be able to apply Fact |. At the base level, we still use the indexing scheme 
for 7r-graphs rather than the one for embedded 7r-graphs. As shown below, the decoding 
process does not require the 7r-graphs to be embedded. 

Let G' be obtained from the undirected version of G by (1) triangulating each 
of its faces that has size more than three such that no additional multiple edges are 
introduced; and then (2) deleting its self-loops. Let C be a cycle of G' guaranteed by 
Fact 0. Let C consists of the (a) edges of G corresponds to the edges of C in G', and 
(b) the edges of C that are added into G' by the triangulation. (C is not necessarily a 
directed cycle of a directed G.) Let Gq be the union of G and C. Let G in (respectively, 
Gout) be the subgraph of Gq formed by C and the part of Gc inside (respectively, 
outside) C. Let C = x\x 2 • ■ -xexe+i, where xe+i = x\. By Fact EL £ — 0(y/n). 

Lemma 4.1. Let H be an 0{n)-node 0(n)-edge graph. There exists an integer k 
with n 0,6 < k < n 0J such that H does not contain any node of degree k or k — 1 . 

Proof. Assume for a contradiction that such a k does not exist. It follows that the 
sum of degrees of all nodes in H is at least (n a6 + n°' 7 )(n - 7 — n 0,6 )/4 = Q(n 1A ). This 
contradicts the fact that H has 0{n) edges. □ 

Let Wk, with k > 3, be a k-wheel graph defined as follows. As shown in Figure [l], Wk 
consists of k + 1 nodes w , Wi, w 2 , ■ ■ ■ , Wk-i, w k, where Wi, w 2 , ■ ■ ■ , Wk, W\ form a cycle. 
W7 is a degree-fc node incident to each node on the cycle. Finally, W\ is incident to 
Wk-i- Clearly Wk is triconnected. Also, w\ and Wk are the only degree-four neighbors 
of wo in Wk- Let k\ (respectively, k 2 ) be an integer k guaranteed by Lemma [11] for G- m 
(respectively, G out ). Now we define G\, G 2 and X as follows. 

Gi is obtained from G in and a fci-wheel graph by adding an edge [w^Xi) for 
every i — 1, . . . , I. Clearly for the case of 7r being a plane graph, G\ can be embedded 
such that Wk x is outside G m , as shown in Figure 0(a). Thus, the original embedding 
of Gj n can be obtained from G\ by removing all nodes of Wk 1 ■ The edge directions of 
G\ — G m can be arbitrarily assigned according to tt. 

G 2 is obtained from G out and a k 2 -whee\ graph Wk 2 by adding an edge (wi,Xi) for 
every i — !,...,£. Clearly for the case of n being a plane graph, G 2 can be embedded 
such that Wk 2 is inside C, as shown in Figure H(b). Thus, the original embedding of 
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(a) Gi (b) G 2 

Fig. 2. G\ and G2. The gray area of Gi is Gj n . The gray area 0/G2 is G ou t- 



G out can be obtained from G 2 by removing all nodes of Wk 2 - The edge directions of 
G 2 — G out can be arbitrarily assigned according to ir. 

Let X be an 0( v /n)-bit string which encodes ki, k 2 , and whether each edge (x i: x i+ i) 
is an original edge in G, for i = 1, . . . , I. 

Lemma 4.2. 

1. G\ and G 2 are ix -graphs. 

2. There exists a constant r > 1 with max(|Gi|, |G 2 |) < n/r . 

3. \G X \ + \G 2 \ = n + 0(n°- 7 ). 
I \X\=0(y/n). 

5. G\,G 2 ,X can be obtained from G in 0(n) time. 

6. G can be obtained from Gi,G 2 ,X in 0{n) time. 

Proof. Since and Wk 2 are both triconnected, and each node of C has degree at 
least three in G\ and G 2 , Statement holds for each case of the connectivity of the input 
7r-graph G. Statements |2|-[5] are straightforward by Fact @ and the definitions of G\, G 2 
and X. Statement [| is proved as follows. First of all, we obtain k\ from X. Since G in 
does not contain any node of degree k\ or k\ — 1, wo is the only degree-/^ node in G\. 
Therefore it takes 0(n) time to identify wq in G\. is the only degree-3 neighbor of 
Wq. Since k\ > £, W\ is the only degree-5 neighbor of Wq. w 2 is the common neighbor 
of Wq and W\ that is not adjacent to w^. From now on, w i} for each i = 3,4, . . . ,£, 
is the common neighbor of Wo and u>j_i other than Wi- 2 . Clearly, w\, w 2 , . . . , we and 
thus Xi, x 2 , . . . , xe can be identified in 0(n) time. G in can now be obtained from G\ by 
removing W^. Similarly, G out can be obtained from G 2 and X by deleting after 
identifying xi, x 2 , . . . , X£. Finally, Gc can be recovered by fitting G in and G out together 
by aligning x±, x 2 , ■ ■ ■ , X£. Based on X, G can then be obtained from Gc by removing 
the edges of C that are not originally in G. □ 
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Remark. In the proof for Statement of Lemma |47|, identifying the degree-fci node 
(and the fci-wheel graph W^) does not require the embedding for G\. Therefore the 
decoding process does not require the 7r-graphs to be embedded. This is different from 



the proof of Lemma 3.1 



Theorem 4.3. Let Gq be an n -node ir-graph. Then G and encode^-(Go) can 
be obtained from each other in O(nologn ) time. Moreover, |encode 7r (Go)| < P{ n o) + 
o((3(na)) for any continuous super- additive function f3(n) such that there are at most 
2/3(n)+o(/3(n)) ^j s £j nc £ n-node ir -graphs. 

Proof. Since there are at most 2°( rilogn ) distinct n-node 7r-graphs, there exists an 
indexing scheme index 7r (G) such that mdex n (G) and G can be obtained from each other 



in 



2' G ' time. The theorem follows from Theorem 12.11 and Lemma (4.2L □ 



5. Concluding remarks. For brevity, we left out F|4] and F[5] in Sections |3] and f|. 
One can verify that Theorems |3.2| and |4.3| hold even if 7r is a conjunction over and 
F0 

The coding schemes given in this paper require O(nlogn) time for encoding and 
decoding. An immediate open question is whether one can encode some graphs other 
than rooted trees in 0(n) time using information-theoretically minimum number of 
bits. It would be of significance to determine whether the tight bound of the number 
of distinct 7r-graphs for each 7r is indeed continuous super-additive. 
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